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Measuring  the  Dependence  Between  Two  Point  Processes 
Through  Confidence  Intervals  for  the  Second  Order  Distribution 

by 

Hani  Doss 


ABSTRACT 

To  assess  the  dependence  structure  in  a  stationary  bivariate  point  process  the  second-order 
distribution  (Ripley  1976,  1977)  can  be  very  useful.  We  prove  that  the  natural  estimates  of 
this  distribution,  based  on  a  realization  Ax  <  A2  <  ...  <  AnA,Bx  <  B2  <  ...  <  Bno  are 
asymptotically  normal,  and  we  present  a  method  for  constructing  approximate  confidence  intervals 
for  this  distribution. 


Let  (Nx,Nb)  be  a  stationary  bivariate  point  process  on  R.  This  article  is  concerned  with 
statistical  methods  for  discovering  and  quantifying  an  association  between  the  two  processes  from 
a  realization  Ai  <  A*  <  ...  <  AnA,Bi  <  B2  <  ...  <  Bna  over  a  long  period  of  time  T. 
The  paper  is  motivated  by  certain  problems  that  arise  in  neurophysiology,  which  are  very  briefly 
described  as  follows  (for  further  details  see  e.g.  Bryant,  Ruiz  Marcos,  and  Segundo,  1973). 

Two  neurons,  A  and  B,  are  monitored  over  a  period  of  time  T  during  which  each  neuron 
fires  a  sequence  of  impulses.  The  problem  is  to  determine  whether  or  not  the  impulse  times  are 
associated.  An  association  between  N A  and  Nb  may  be  construed  as  evidence  that  either  the 
two  neurons  are  communicating,  or  that  they  both  share  input  from  a  third  source. 

Another  problem  arises  in  certain  neurophysiological  studies  of  learning  and  memory.  An 
animal  is  to  be  taught  (trained)  to  perform  a  certain  task.  Now  consider  two  “connected”  neurons, 
A  and  B,  which  are  essential  in  the  performance  of  this  task.  Record  the  impulse  times  during 
a  period  before  the  learning  experience,  obtaining  a  realization  of  {N  ,  N^ef  ),  and  during  a 
period  of  time  well  after  the  learning  experience,  obtaining  a  realization  of  (n*11 ',  N^ft  ).  The 
processes  N%e*'  and  may  be  dependent.  The  problem  is  to  determine  whether  or  not  this 

dependence  is  “stronger”  for  the  processes  N^1  and  N^1' .  A  neurophysiologist  may  consider 
a  change  in  the  strength  of  the  dependence  as  evidence  that  learning  has  taken  place. 

The  two  problems  have  very  different  statistical  character.  Let  S  be  a  statistic  that  “mea¬ 
sures”  the  dependence  between  two  point  processes.  The  first  problem  is  one  of  testing  the 
hypothesis  that  NA  and  Nb  are  independent,  and  requires  only  knowledge  of  the  distribution  of 
5  under  the  assumption  that  NA  and  Nb  are  independent.  The  second  problem  is  much  more 
difficult:  to  compare  5  across  two  situations  we  must  know  the  distribution  of  S  when  the  two 
point  processes  are  dependent. 

In  a  more  general  context,  Ripley  (1976,  1977)  introduced  a  measure  K ,  defined  on  an 
appropriate  space,  that  summarizes  the  second-order  properties  of  the  process.  Before  describing 
this  measure,  we  need  to  state  some  assumptions  and  introduce  some  notation.  Let  Ni(s ,  t)  denote 
the  number  of  events  of  type  »  occurring  in  the  interval  (s,t],  for  t  =  A,B.  Assume  that  each 
process  has  no  multiple  occurrences,  and  that  the  intensities 


are  finite.  (The  existence  of  these  limits  was  proved  by  Khintchine,  1960).  The  A*’s  then  have  an 
interpretation  as  mean  number  of  occurrences  per  unit  of  time:  for  <  t2, 


E  Ni{ti,t7)  =  Xiih-ti)  for  i  =  A,B 


(2) 


(this  follows  from  Dobrushin’s  Lemma  and  Korolyuk’s  Theorem;  see  Leadbetter,  1968). 

We  now  give  an  informal  description  of  the  measure  K ,  adapted  to  the  present  context. 

The  measure  K  is  defined  on  the  Bore]  subsets  of  R,  and  for  <  /2,  writing  /C(/ 1 , <2)  for 
we  have 


/f(<i,t2)  =  ’r~E{NA(t i,f2)  |  a  B  point  at  t  =  0} 

^=—-E{ND{-h,-ti)  |  an  A  point  at  t  =  0}^. 
Note  that  if  NA  and  ND  are  independent,  then 


(3) 


#(*i,<2)  —  h  ~ 


(4) 


regardless  of  the  values  \A  and  A x? ;  thus,  K  represents  Lebesgue  measure  on  R. 


Ripley  proposed  the  estimate  of  K(tl,t2)  given  by 


k(t  i,t2) 


T 

HaH-D 


ng  n  A 

Y]  -  b,  t  (<i,<2)}, 


»=i y=i 


(5) 


where  /{•}  denotes  the  indicator  function  (actually,  the  estimate  proposed  by  Riple  y  has  an  edge 
correction  for  points  near  the  boundary  of  the  period  of  observation;  this  edge  correction  will  not 
concern  us). 


Previous  work  on  the  estimator  K  is  concerned  with  spatial  processes.  The  results  center  on 

A 

using  K  to  test  that  a  single  process  in  Poisson  (Ripley,  1977;  Chapter  8  of  Ripley,  1981;  Silverman, 
1976)  and  on  using  K  to  test  for  independence  of  two  processes  (Lotwick  and  Silverman,  1982; 
Diggle  and  Milne,  1983). 


In  this  paper  we  study  the  asymptotic  properties  of  K{ti,t2)-  The  contributions  of  this  paper 
are  two-fold: 


first,  a  proof  that  under  certain  regularity  conditions,  as  no  — ♦  oo 
yfc(K(tuh)  ~  K(tift2))^}/(0ta7(tltt2))t 

and  second,  a  method  for  constructing  consistent  estimates  of  o2(tl,t2)- 


(6) 


(7) 


Besides  providing  the  basis  for  a  test  of  independence  between  NA  and  No,  (6)  and  (7)  enable 
one  to  test  whether  or  not  K(ti,t2)  has  changed  in  the  experimental  situation  described  earlier. 


The  cross-intensity  function  defined  by 

=  lim  P{Na(u  +  t,u  +  t  +  hi)  >  0;  NB(t,t  +  /i2)  >  0} 

hi,hi—o  nih2 


is  related  to  K  by 


^  X AD[u)du. 


(8) 


(9) 


Under  the  independence  hypothesis,  XAo (u)  =  XAXu  for  all  u.  Brillinger  (1976)  considered  the 
random  function 

jJb(u)  =  5^  «  (u-h,u  +  h)}  (10) 

i=l  J=l 

and  showed  that  under  suitable  regularity,  if  h  — *  0  and  T  — *  oo  in  such  a  way  that  hT  re¬ 
mains  constant,  then  for  -»  uk,\ul  -  u£|  >  2h,  1  <  k  <  k'  <  A 1,JAD(UI)  are  asymptoti¬ 
cally  independent  Poisson  random  variables  with  means  2hTXAu  (u*),  for  k  =  l,...,Af.  Thus, 
A ad{u)  =  can  use<*  *°  estimate  A Ap[u)  at  a  finite  number  of  points. 


In  practice  one  would  graph  the  two  functions  XAjj  and  K  over  a  finite  range,  say  [-L,  L\  (i.e. 
graph  K(  —  L,t)  for  —  L  <  t  <  L).  Although  from  a  mathematical  viewpoint  \Aa  and  I\  contain 
essentially  the  same  information,  the  statistical  properties  of  their  estimates  are  quite  different: 
estimation  of  XAD  is  akin  to  estimating  a  density,  and  from  Brillinger’s  result  the  variance  of 
XAd  >s  of  the  order  on  the  other  hand,  estimation  of  K  is  akin  to  estimating  a  distribution 
function,  and  from  (6),  the  variance  of  K  is  of  the  smaller  order  A  graph  of  \ad  may, 
however,  indicate  features  (spikes,  location  of  maxima  and  minima,  etc.)  that  cannot  be  easily 
seen  in  the  graph  of  K .  Clearly  the  two  approaches  are  complementary. 


Uab  (*1,^2)  =  E{NA(tut2)  |  a  B  point  at  t  =  0}. 


(11) 


We  may  estimate  UAB{t  1^2)  by 

UAB(ti,h)  =  —  ^2  I  [A]  ~  Bi  £  (ti,t2)}’  (12) 

nB  <= 1  y=i 

Letting 

A<  =  ^  for  *  =  A,  B,  (13) 

we  note  that 

E(ti,t2)  =  r—  Uab{1  i,h)t  and  ^(*1**2)  =  t—  Uab^u^)'  (14) 

*a  XA 

To  prove  asymptotic  normality  of  K  (Theorem  2)  we  will  prove  joint  asymptotic  normality  of 
ifi ab (t  1 1  ^2) >  Ax)-  We  will  in  fact  find  it  necessary  to  first  prove  joint  asymptotic  normality  of 
(^vU^ii^iAyi/Ai^l/Afl).  The  delta-method  (i.e.  a  first  term  Taylor  expansion)  applied  to  the 
function  f(x,y,z)  =  then  yields  the  asymptotic  normality  of  AT(<  1, <a).  We  also  obtain  the 
joint  asymptotic  normality  [UadI^i^^),  >>aAb)  by  applying  the  delta-method  to  the  function 
g(x,y,z)  =  (x,y/z,l/z). 

We  now  need  to  give  the  statistical  setting  of  our  asymptotic  investigation.  The  functions 
^AB^ifh)  and  E(titt2)  involve  the  notion  of  the  Palm  measure.  That  is  for  e  >  0,  we  consider 
the  conditional  distribution  of  the  process  ( Na,Nb )  given  that  there  is  a  B  point  in  the  interval 
(0,e),  and  take  the  limiting  distribution  of  ( NA}Nb )  as  z  — ♦  0.  Intuitively,  this  corresponds  to 
selecting  a  B  point  “arbitrarily”,  and  considering  the  process  with  that  point  labeled  the  origin. 
This  notion  is  discussed  for  univariate  processes  by  Leadbetter  (1972)  and  for  bivariate  processes 
by  Wisniewski  (1972).  We  will  assume  that  the  process  is  observed  during  a  period  of  length  T 
starting  immediately  after  the  occurrence  of  an  “arbitrary"  B  point,  say  Bo  (thus,  we  will  be 
working  with  the  Palm  measure).  This  mode  of  sampling  is  called  semisynchronous  sampling  by 
Cox  and  Lewis  (1972);  see  Wisniewski  (1972)  for  some  fundamental  properties  related  to  it.  Also, 
for  the  sake  of  convenience,  we  will  assume  that  the  period  of  observation  ends  with  a  B  point. 


M  *»»_ L*  *  t  *  l 


,1  »- 1  ■,!  i 


We  now  consider  two  subfields  of  the  cr-field  on  which  the  Palm  measure  is  defined.  Let  7-^ 
denote  the  a-field  generated  by  the  events 

{Bkl  f.  (B0  +  +  toi),.  .  .,B*m  £  (S0  4-  vm,B0  4-  wm); 

Na(B0  +  rlfSo  +  si)  =  Ai,...,A^(Bo  4-  rn,So  4-  <sn)  =  hn} 

for  v,-  <  wi  <  0,ki  =  -1,-2,...,*  =  l,...,m;ry  <  s3-  <  0,hj  =  0,1,2 =  l,...,n,  and  m 
and  n  nonnegative  integers.  For  u  >  0,  let  Jjf£+U  denote  the  ^-field  generated  by  the  events 

{Bkl  £  (So  +  vi,  So  4-  Wi),  •  •  • » Bktn  £  (So  4-  um,  Bo  4-  u>m); 

Na{Bq  4-  rj,  So  4-  si)  =  hi, ... ,  Na{Bo  4-  rn,  Bo  +  sn)  —  hn} 

for  u  <  Vi  <  Wi,ki  -  1,2,...,*  =  l,...,m;u  <  ry  <  sy,A,  =  0,1,2,..., j  =  1 . n,  and  m  and 

n  nonnegative  integers.  Let 

a(u)  =sup{|S(Sx  nS2)  -  P(S1)S(S2)|;S1  £  7^,E2  e  7%,+J.  (15) 

If  a(u)  — ►  0  as  u  — ►  oo,  then  the  distant  future  is  virtually  independent  of  the  past.  We  will 
actually  need  stronger  conditions  on  a(-). 

Let  /?>0,T7>l,0<r<lbe  any  constants  satisfying 


/  V  4-  1\ 


T  >  1. 


Assumptions: 


f°° 

A 1  /  [a(f)]Tf/?  dt  <  oo 

vo 

A2  sup  s{[ATD(y,J  +  1)]"  |  a  S  point  at  <  =  o}  =  D  <  oo 

—  oo<j  <oo  ' 

A3  s|[N>i(^i^2)4(1+t^7)  I  a  B  point  at  t  =  oj<  oo 
A4  s[(Si  -  S0)4(1  +  r^)]  <  oo 

r  1  4( 1  -4-  ) 

A5  S[W>l(So,S1)j  <  oo 


Before  stating  our  theorems  we  discuss  our  assumptions,  and  compare  them  with  those  of 
Brillinger  (1976).  Any  theorem  giving  asymptotic  normality  of  the  normalized  partial  sums  of  a 


■N  >*'■■■'  ■'  -  ‘N  -  "  >  *Vv'-'  -J1  'j  *-■  *  *•  ■-•'j'j'j  •-*>-»  V  V  v'j,  -J  'J.  J.  ■ 


stationary  sequence  {2’»}  must  assume  a  moment  condition  on  and  also  a  mixing  condition  on 
{Ti}.  In  general,  we.'  kening  of  the  moment  condition  must  be  compensated  by  strengthening  of 
the  mixing  condition,  and  vice  versa.  Assumptions  A3,  A4,  and  A5  provide  moment  conditions 
on  the  sequences  {Ui},{NA[Bi-uBi)},  and  {( Bi  —  J3* _ i ) } ,  respectively.  Assumption  A2  ensures 
that  the  B  process  “moves  along”  rapidly  enough  so  that  Al,  the  mixing  condition  imposed  on 
the  point  process,  translates  into  a  mixing  condition  for  the  sequences  {[/,},  {Na [Bi- 1 ,  Bi)},  and 
{(!?,•  —  Bi-i)}.  Relationship  (16)  describes  in  a  technical  way  the  interplay  between  the  mixing 
rate  on  the  point  process  and  the  moment  condition  on  the  sequences  {f/j},  {Na  (J9»-i,  Bi)},  and 

m  -  ft-.)}- 

The  conditions  assumed  by  Brillinger  (1976)  neither  imply  nor  are  implied  by  A1-A5  of  the 
present  paper.  Brillinger  assumes  a  mixing  condition  on  the  bivariate  point  process  and  also 
that  the  “second  order  moments”  A,y(-)  (i,j  =  A,B)  exist  and  are  continuous  (he  also  assumes 
existence  and  continuity  of  the  “third  and  fourth  order  moments”  ;  see  equation  (2.2)  of  his  paper). 
This  condition  on  Xad{')  is  not  satisfied  by  the  following  process:  Nb  is  a  Poisson  process,  and  Na 
is  Nb  shifted  to  the  right  by  1  unit.  In  this  case,  Aj4b(1)  =  oo.  This  process  does  however  satisfy 
A1-A5.  Conversely,  it  is  easy  to  find  processes  ( Na,Nb )  satisfying  all  of  Brillinger’s  conditions, 
but  not  those  of  the  present  paper.  Perhaps  the  simplest  example  is  the  following.  Let  NA  and 
Nb  be  independent,  Na  being  a  Poisson  process,  and  Nb  being  an  equilibrium  renewal  process 
on  (— oo,  oo)  (for  a  definition  and  a  construction  see  pp.  517-519  of  Karlin  and  Taylor,  1975)  with 
interarrival  distribution  having  a  first  moment  but  no  second  moment.  Then  A4  is  violated,  and 
it  is  not  difficult  to  check  that  this  process  satisfies  all  of  Brillinger’s  conditions. 

THEOREM  1.  Assume  Al  and  A2.  Let  Uyi,o(ii,t2),iyjt0(ti,f2),  and  A;  for  i  —  A,B  be  defined 
by  (11),  (12),  and  (13),  respectively. 

(i)  Under  A3,  we  have  as  n#  —*  oo 

y/nD{UAD{ti,t2)  -  U>ib(^i><2))— >A/(0,72(f i,t2))- 

Furthermore,  any  estimate  72(f i,f2)  of  the  form  (25),  satisfying  (27)  and  (28),  is  a 
consistent  estimate  of  72(ti,<2)- 

(ii)  Under  A4  and  A5  we  have  as  — ►  oo 

>/^d{Xa  ~  XaAd  ~  XbY^*M{0,A). 


Furthermore,  any  estimate  A  of  the  form  (57)  (refer  to  equations  (49)  and  (5l)-(56)), 
satisfying  (27)  and  (28)  is  a  consistent  estimate  of  A. 


(iii)  Under  A3-A5  we  have  asns  -*  oo 


Furthermore,  any  estimate  ^2{ti,t2)  the  f°rrn  (56)  (refer  to  equations  (49)  and  (51)- 
(55)),  satisfying  (27)  and  (28),  is  a  consistent  estimate  of  5Z(ii , <2) . 

PROOF  of  (i).  We  begin  by  showing  asymptotic  normality.  Let  U-  and  17,  be  defined  by 

OO 

U'i  =  c  (^»  +  +  <2)}  I{Bq  <  Aj  <  Bno} 

j=- 00 


and 


Note  that 


Ui=  £  I{Ai*{Bi  +  ti,Bi  +  t2)}. 


j=~oo 


u 


AB 


(«..«»)- A  £uj. 

nB  f-f 


(17) 


»=i 


It  is  clear  that  X)"=i  “  Xw=i  =  Oj»(l)>  Thus,  it  suffices  to  prove  the  result  with  Ui  s  instead 

of  U/’s  in  (17).  Observe  that  the  sequence  {U»}£1_00  is  stationary,  with  mean  UAn{t  i,t2)  and 
finite  variance  (by  A3).  The  t/,-’  s  may  be  far  from  independent:  for  small  k,Ui  and  £/,+*  may  be 
nearly  identical.  If,  however,  t/,-  and  Ui+k  are  “nearly  independent”  for  large  k,  then  one  can  still 
hope  to  have  a  Central  Limit  Theorem  effect.  The  proof  consists  of  translating  Al,  the  mixing 
condition  on  the  point  process,  into  a  mixing  condition  on  {17,}  that  allows  the  application  of  an 
appropriate  central  limit  theorem  for  stationary  sequences. 


Let  ji(&)  be  defined  for  k  =  1,2,...  by 


fi(k)  =  sup{|P(F?i  n  E2)  —  P(Ei)P(E2)\]  Ei  e  o{. . .  t/_„l/0),  E2  c  o{Uk,  Uk+U . . .)}  (18) 

(Here,  a{. . .  L/_  1 ,  U0)  denotes  the  a-field  generated  by  {. . . ,  t/_j,  Uo),  and  similarly  for 
<r(Uk,  Uk+u . .  •))•  The  function  n(-)  is  called  the  mixing  coefficient  of  the  sequence  {Ui}.  Our 
goal  is  to  prove  that  5ZkLi!M(fc))T  <  00 •  It  will  be  more  convenient  however,  to  show  instead  that 
X^Li(ji(2£)]T  <  00.  The  two  conditions  are  equivalent  since  /i(-)  is  nonincreasing. 


Let  k  >  1  be  fixed,  let  E  i  t  o(. . . ,  V-k),  E2  t  a(Ukt...),  and  consider  P(E\  n  £2).  Let 
C-k  =  {B.k  -  Bo  <  and  Ck  =  {Bk  -  B0  > 

We  may  write 

P(£i  n  E2)  =  P{(P1  n  c.k)  n  (£2  n  c*)}  +  P{(£x  n  F2)  n  (Cifc  uCtc)},  (19) 

where  c  denotes  complementation. 

Consider  the  first  term  on  the  right  side  of  (19).  For  all  large  k,  since  B-k  -  B0  <  — 
implies  that  B_fc  +  t2  <  Bq ,  we  have  Pin  CL*  e  Furthermore,  £2nCjt  £  ^D^+[jti/(/s+i)]+tl  • 

Therefore, 

P{(£i  n  C.k)  n  (P2  n  C*)}  <  P(El)P{E2)  +  +  tx).  (20) 

The  second  term  on  the  right  side  of  (19)  is  obviously  less  than  or  equal  to  P(CZk)  +  P(Ck). 
These  last  two  probabilities  are  dealt  with  in  the  same  way.  Consider  P(C£).  Observe  that 
P(Ck)  <  P{one  of  the  intervals  (Bo  +  j,  Bo  +  j  +  l),  j  =  0,1,...,  [A:1/,*/?+1^}  -  1 

(21) 

has  at  least  [A:^/^+1l]  points  }. 

By  A2,  Chebyschev’s  inequality  and  Boole’s  inequality,  the  right  side  of  (21)  is  less  than  or  equal 
to  [fc1/(/?+1l]  D  Combining  this  with  (20)  and  handling  the  opposite  inequality  in 

a  similar  way,  we  obtain 

/z(2 k)  <  a(klW+1)  +  <j)  +  *i/(/»+i>£)[jfcW+J )]-•». 

Assumption  Al  implies  that 

CO 

+ti)V  <  00.  (22) 

Combining  (22)  and  (16)  we  obtain  that  ]CfcLilM(2A:)]T  <  00,  and  hence  that 

OO 

X^(Jc)]T  <  °°*  (23) 

k= 1 

Assumption  A3  implies  that 

F[t/12(l+r^)]  <  00. 

This,  together  with  (23)  allows  us  to  apply  Theorem  1.7  of  Ibragimov  (1962)  to  conclude  that 
the  series  Cov(£/o,  t/j,)  converges  absolutely,  and  that  as  hr  — ♦  00 

~  UAo(ti,t2))^ >.V(0,72(fi,f2)), 


a  m  j 


we  have 


E!?[tl,ti)**lla{tLt2).  (33) 

From  now  on  we  drop  the  arguments  ti  and  t?  whenever  convenient.  The  triangle  inequality  gives 

Eli2  -  VI  <  £|i2  -  i’l  +  Eli’  -  li. I +  b’.  -  Vl-  (31) 

Thus,  our  objective  is  to  show  that  each  of  the  three  terms  on  the  right  side  of  (34)  converges  to 
0  as  ns  — < ►  oo.  It  is  easy  to  see  that  under  (27)  and  (28),  | -  'y2|  — ♦  0  as  — *  oo. 

We  now  consider  E \i2  —  Let  c0  =  5.  We  have 


M 


M 


E\ ^  -'ll, I  =  E\2  ^  ch{uK  -  i/k)\  <2^2  EWh  ~  *h\ 

h=0  h—0 

M  1  M 

<  2  ^{£(i>a  -  i/h)2}  =  2  ^2  (Var  vk}l. 


(35) 


0 


h-0 


We  now  examine  the  variance  terms.  It  is  well-known  (and  easy  to  see)  that  if  {}' }  is  a  stationary 
sequence,  then 

Var  (if 'Y<)=-  E  (l-  -)  Cov  (Yt,Yl+l).  (3C) 

i=l  |*|<n 

Equation  (30)  shows  that  vk  is  of  the  form  vk  =  ^  Fi,  where  Yt  —  (U,  -  {/)(£/,  +  /,  -  U), 

with  {Yi}  stationary.  We  can  thus  obtain  the  exact  variance  of 


Var 


1 

Uh  = - r 

nD  -  h 


E  0 


Kl 

n/j  -  h 


(37) 


where 


oM  =  Cov((f7,  -  UAD){Ui+h  ~  UAB),  (Ui+f  -  UAD)(Ul+h+,  -  UAD)).  (38) 


To  obtain  a  useful  bound  on  Var  uk  we  will  show  that 


M9I<  [d((|<| -*)+)]' c. 


(39) 


where  a+  =  max{a,0}  and  C  is  a  constant  not  depending  on  h  or  i.  The  key  ingredient  in 
the  proof  of  (39)  is  the  use  of  a  lemma  of  Ibragimov  (1962)  that  gives  an  explicit  bound  for  the 
covariance  of  two  random  variables  £  and  r)  satisfying  £  e  o(. . . ,  U-i,U0)  and  ri  1  o(Uk,Uk+ 1, . . .), 
in  terms  of  n[k)  and  certain  moments  of  £  and  77. 

10 


amaaiBareaia^^ 


•*.  V/v  ■ 


For  any  t  such  that  j/|  >  h,  Lemma  1  3  of  Ibragimov  (1962)  implies  that 


**(0  <  |m(|/1  -  h)]'{4  +  6  E\(Ut  -  VaB)(V„k  ~  ^B)!3,,+  ri7'} 


(<0) 


The  Cauchy- Sch wars  inequality  implies  that 


£|(t/.  -  UAB)(Ut+h  -  UaB)\«1  +  t^'  <  E |(£/.  -  t/4B)|4,,+ 


(41 


By  A3  the  right  side  of  (41)  is  a  finite  constant  not  depending  on  h  or  l  This  proves  (39)  for 
|f|  >  k.  For  |f|  <  k  the  proof  is  even  simpler,  and  is  omitted.  We  now  combine  (37)  and  (39)  to 
obtain 


Var  i>a  <  — -  ^  |oa(0i  v  — ^~T  (X]  +  XI 

no  -  .  nD  h  v  fr.  rr. 


1*1  <  »a  -a 


ri<a 


t  >h 


i  00 

<  -  ((2 h  f  DC  +  X.  M(/)  fc). 


142' 


y-t 


Next,  we  use  (23)  and  (27)  to  see  that  there  exists  an  integer  J,  not  depending  <m  h  nr  i. 
that 

-  a 

(43. 


Var  i \  <  nD^  for  all  np  '•  J 


We  now  return  to  E  -y*  I  (***  (25))  Combining  (27)  and  (43)  we  arrive  at 


En2  il.  ♦ « 


*>  Tip  ♦  OC. 


1 1 


To  complete  the  proof  of  (29)  we  need  only  show  that  /  V  v  •  0  as  n/ 

ki  w 

Ey  Y  E  2  XI  r/.(‘W,  i-/J  *  2  X  E  i'i 

We  will  presently  show  that 


X  W.  t.a'.e 


*  'h  ‘  l. 


I 


h  _o 


h-O 


<  •' 


r*n 


where  C 1  is  a  constant  not  depending  on  h 


Referring  to  equations  (26)  and  (30),  it  is  easy  to  see  that 


=  +  -UAB)(UA0  fOMf"21  (’adUI  at  n  ■  [f  rAPr. 

n 


(7 


where 


0(0  =  — l — 

nB  —  h 


»« -h- 1 


■ "  *  i  ••  B  * 

^  =  —  y.  v, 

1=0  t=A 

Thu.,  E\Ph  -  uK |  <  E\{0W  -  UAB)(UAB  -  0)\  +  E\(0W  -  UAB){UAB  -  0)\  +  E(0  -  UAB)2. 
Consider  first  E(0  -  UAB)*.  From  (36)  we  have 


w-uab)1^^-  Y  (1  - ir) 5 F ’ 

nB  ..rf  nB  nB 


|/|<ns 


91 

nB 


where  C"  =  j/q  +  2  t  |t/*|  <  oo.  Similarly, 


E(0^-Uab)7<  -,  and  E(V™  -  UAB)*  < 


nB  -  h  '  -  nB-h 

The  Cauchy-Schwari  inequality  now  yields  (46),  which  together  with  (27)  imply  that  £’|'7a  —  7a|  — < ► 
0  as  nB  — *  oo.  This  completes  the  proof  of  (i). 

We  next  prove  (iii),  since  the  proof  of  (ii)  proceeds  in  a  very  similar  way. 

PROOF  OF  (iii).  For  <  R,  let  X,  =  U  Ui  +  fc  NA{BitBi+l)  +•  $3(Si+i-  -B<).  The 

sequence  {X,}”.^  is  stationary,  and  if  jt(-)  denotes  its  mixing  coefficient,  it  is  clear  that  (23) 
holds  for  *(•)  as  well.  This  give  a  central  limit  theorem  for  {X,}.  It  is  simple  to  argue  that 
E  Na(B0,Bi)  -  XA/XB  and  that  E(Bi  -  Bo)  =  1/Ajj.  By  the  Cramer- Wold  device  we  now  have 
that 

-  - uAB.— -  X— -  r)‘  (48) 

\  n o  nB  XB  nB  XB  / 

is  asymptotically  normal  with  mean  0  and  covariance  matrix,  say  ty.  To  identify  ❖  and  describe 
consistent  estimates  of  it,  it  is  convenient  to  introduce  additional  notation.  Let 


Vtil)=u„  vl7)  =  NA(B^l$Bi),  W3>  =  Bi+1  -  Bi  for  »  =  ... -2,-1, 0,1, 2,...  (49) 
Let 

=Cov(V0(p',^,>)  for  p,q  =  1,2,3,  fc  =  ... -2,-1, 0,1, 2,...  (50) 

Note  that  1*  =  It  is  clear  that  the  asymptotic  covariance  matrix  of  (48)  is  equal  to  the 
matrix  whose  pqth  entry  is  ♦(*’*)  =  XThL-oo  (note  that  is  not  necessarily  equal  to 

unless  p  =  7).  Next,  for  p,q  =  1,2,3,  define 

E:=Vfc"1  (Vr,,P>-^(p))(K+i-^(,))  for  h  =  0, 1,2, . . . ,  M 


^  E*=V‘  (V.(P)  -  K(P»)(V.(;»  -  V(’>)  for  h  =,  —  1,  —  2,  -  3, . . . ,  -  M 


■  (v) 


(51) 


where 


»l0  —  1 


V (p)  =  —  V'  y.W# 

n  d  ■  ^  * 


n*  ££ 


4,(m)  _  C|fc|V»iP,)  for  ptq  =  1,2,3, 


h=-M 

where  M  and  {c/,}  satisfy  (27)  and  (28),  respectively.  Also,  let 

9  =  matrix  whose  pqth  entry  is  (54) 

The  same  argument  that  was  used  in  the  proof  of  (i)  now  applies  and  we  see  that  'I'  converges  to 
^  componentwise  in  probability  as  — >  oo. 

Consider  now  the  function  <?(x,y,«)  =  (i, y/2,  l/z),  which  maps 

Ui/nB ,nA/nB,T/n.B^J  into  ^Z?=i  j-  The  derivative  of  g  evaluated  at 

(x,y,z)  is 

100 

Dg[x,y,z)  =  (  0  J  7?).  (55) 

'0  0 

An  application  of  the  delta  method  (with  the  function  g )  to  the  vector  (48)  gives  the  asymptotic 
normality  result  asserted  in  Part  (iii)  of  the  theorem,  with 

Z(*  1>*2)  =  Z  =  Dg(UAB,^A/^B,l/^B)'&lDg{.UAB,^A/^B,l/^B)]'- 

Defining  Z  =  Z(*i><2)  by 

Z  =  Dg{UAB^Al^BAl^B)^f\Dg{UAB^Al^BAl^B)W  (56) 

it  is  clear  that  under  (27)  and  (28)  Z  converges  to  Z  componentwise  in  probability  as  tib  — *  00. 
PROOF  of  ii.  As  was  mentioned  above,  the  proof  of  (ii)  is  very  similar  to  that  of  (iii).  We 

(  \  (p<f) 

now  consider  the  variance  estimates.  Let  Z  M  and  Z  denote  the  pqth  entries  of  Z  and  Z> 
respectively,  and  let 

A  (g(32)  g(33)J’  ^ 

with  a  similar  definition  for  A.  Obviously,  this  is  the  same  A  that  appears  in  the  statement  of  Part 
(ii)  of  Theorem  1.  It  is  clear  that  under  (27)  and  (28),  as  n#  — ►  00,  A— t  A  componentwise  in  prob¬ 
ability.  This  completes  the  proof  of  Theorem  1.  I 


THEOREM  2.  Let  K[ti,t2)  and  K{titt2)  be  defined  by  (3)  and  (5),  respectively,  and  assume 
A1-A5.  Then,  as  nB  — ♦  oo 

Furthermore,  any  estimate  o2{ti,t2)  of  the  form  (58)  (refer  to  equations  (49)  and  (5l)-(54)), 
satisfying  (27)  and  (28),  is  a  consistent  estimate  of  er2(i1, *2)- 

PROOF.  We  apply  the  delta  method  to  the  vector  (48)  with  the  function  f(x,y,z)  =  — .  The 
derivative  of  /  at  the  point  (x,y,z)  is  Df(x,y,z)  =  (*,  7^,  *).  Evaluated  at  {UAb,Xa/Xb,  *7) 
and  this  is  (-£-,!(  Ajg)  and  (-p-, Kj*-,  K  \B),  respectively.  The  asymp- 

totic  normality  asserted  in  the  theorem  follows  from  the  asymptotic  normality  of  the  vector  (48), 
with 

°*(t uh)  =  (~,K^-,K  \„)  9  (~,K^S.,K  A „)' 

AA  AA  Aa  Aa 

and  it  is  clear  that  under  (27)  and  (28),  if  o2(tltt2)  is  defined  by 

o2(tut2)  =  Afl)  *  XBy  (58) 

aa  aa  aa 

then  as  nB  — »  00 ,o2(ti,t2)  converges  to  o2{t\.,t2)  in  probability.  This  completes  the  proof  of  The¬ 
orem  2.  | 

Results  giving  the  asymptotic  normality  of  estimates  of  \A  and  \B  (under  varying  sets  of 
assumptions)  already  exist  in  the  literature;  see  e.g.  Theorem  8.6  of  Daley  and  Vere-Jones  (1972). 
It  was  necessary  to  establish  joint  asymptotic  normality  of  XA  and  UAB{h,h)  in  order  to  obtain 
asymptotic  normality  of  AT(ti,t2)- 

We  now  discuss  the  choice  of  the  constants  M  and  ci,c2,. . .  ,cm,  which  enter  into  the  esti¬ 
mates  ”72  and  $  given  by  (25)  and  (53),  respectively,  and  for  the  sake  of  simplicity,  our  discussion 
is  in  terms  of  72  only.  It  is  appropriate  to  discuss  the  choice  of  these  constants  within  the 
framework  of  spectral  density  estimation.  Defining 

1  1  00 

/M  =  ir1' 0  +  -  UhC0S  uh » 

2?r  ir 

h—l 

we  see  that  72  =  2w/(0).  To  estimate  /(a/)  we  must  effectively  estimate  ///,  for  each  h.  For  fixed 
riB^h.  may  be  estimated  for  h  =  0,1,...  ,nB  —  1.  However,  because  of  the  fact  that  for  fixed  nB 


the  variance  of  Oh.  increases  with  h,  it  is  standard  to  consider  estimates  of  the  form 


1  1  M 

fH  =  7-^0  +  -  ^2  Chuh  cos. 


where  M  is  much  smaller  than  ng,  and  cj,  decreases  as  h  increases.  For  a  given  value  of  M 
the  constant  c are  usually  given  by  c*  =  u>(£)  for  some  function  w  (called  the  “lag  window”) 
defined  on  (0,1),  satisfying  the  following:  u>(0)  =  l,to(l)  =  0,  and  w  decreases  smoothly.  Two 
commonly  used  choices  are  the  Blackman-Tukey  and  the  Parzen  windows;  see  pp  514-516  of 
Anderson  (1971)  for  a  definition  of  these.  Also,  see  Chapter  9  of  Anderson  (1971)  for  a  general 
discussion  of  estimation  of  the  spectral  density.  It  is  clear  that  as  M  increases  the  bias  of  /( w) 
decreases  while  its  variance  increases.  For  both  the  Blackman-Tukey  and  the  Parzen  windows  as 
well  as  for  most  of  the  commonly  used  windows,  a  value  of  M  of  the  order  is  usually  used,  since 
(under  certain  conditions  on  the  stationary  series)  this  minimizes  the  asymptotic  mean  squared 
error.  See  Section  9.3.4  of  Anderson  (1971).  Thus,  condition  (27)  is  not  at  all  restrictive. 

It  should  be  noted  that  the  part  of  the  proof  of  Theorem  1  that  gives  the  consistency  of 
/( 0)  applies  equally  well  to  the  estimates  /(w)  for  any  w,  and  similarly  for  the  spectral  density 
estimates  of  the  series  {NA(B<+t,  B<)}  and  {J9,>  j  -  B,}. 

A 

It  was  necessary  to  give  a  proof  of  the  consistency  of  /( 0)  because  the  currently  available 
consistency  results  for  spectral  density  estimates  are  valid  under  conditions  on  {Ui}  that  are  not 
implied  by  Assumptions  Al-  A5  (e.g.  existence  of  all  moments  in  Brillinger,  1975;  {t/,}  is  a  linear 
process  as  in  Anderson,  1971,  and  in  Hannan,  1970). 


The  methods  described  in  this  paper  enable  the  construction  of  asymptotic  confidence  inter¬ 
vals  for  K(ty,t 2),  for  fixed  values  of  fj  and  <2-  The  function  K(-,  )  will  usually  be  of  interest  over 
a  continuum  of  values,  say  —L<ti<t2<L,  where  L  is  some  number  much  smaller  than  T. 
One  can  plot  K(-L,t)  for  —  L  <  t  <  L  or,  what  is  sometimes  more  useful,  plot  K{t  -  +  ~)  for 

- L  +  <  t  <  L  —  d.  Here,  d  is  some  small  number  representing  the  experimenter’s  guess  at  the 

duration  or  likely  duration  of  the  effect  of  a  B  point  on  the  A  process.  The  function  K(t  -  +  fj) 

is  identically  equal  to  d  if  NA  and  Ng  are  independent. 
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We  may  form  the  bands 


K(—L,t)  ±  o(—L,t)/y/nB  -  L<t  <  L 

and 

where  d(fi,f2)  is  an  estimate  of  tr(ti,ti)  and  is  the  upper  ^  •  100  percentile  point  of  a 

standard  normal  variable.  These  bands  of  course  are  not  simultaneous  confidence  bands.  To  form 
simultaneous  confidence  bands  one  would  need  to  carry  out  two  distinct  steps: 

(i)  Establish  weak  convergence  of  the  processes 

^».(0  =  -  K(-L,t)) 

and 

w'-.M  =  VS?  m  -  f.«  +  f  )-*(«-  f,«  +  5)) 

to  Gaussian  processes  V(t)  and  W(f),  respectively. 

(ii)  Obtain  t^a)  and  the  upper  a  •  100  percentile  points  of  sup_L<t<L  | V (< ) |  and 

suP-L+$<t<L-$  IW'WI.  respectively. 

The  bands 

K{-L,t)  ±  v(a>  ly/ZE  -  L  <  t  <  L 

and 

^(«-5.«  +  5)±®'*VVS?  - L+i<t<L-i 

are  then  asymptotic  simultaneous  confidence  bands. 

A  proof  of  weak  convergence  appears  extremely  difficult.  Although  desirable  from  a  theo¬ 
retical  point  of  view,  weak  convergence  is  not  useful  statistically  unless  the  distribution  of  the 
supremum  of  the  absolute  value  of  the  limiting  process  can  be  obtained.  Unfortunately,  this  is  in 
general  a  very  difficult  problem  even  if  the  Gaussian  process  is  stationary  (see  Cressie  and  Davis, 
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